Paper 928 Complex Group-By Queries for XML
نویسندگان
چکیده
The popularity of XML as a data exchange standard has led to the emergence of powerful XML query languages like XQuery [21] and studies on XML query optimization. Of late, there is considerable interest in analytical processing of XML data (e.g.,[2, 3]). As pointed out by Borkar and Carey in [3], even for data integration, there is a compelling need for performing various group-by style aggregate operations. A core operator needed for analytics is the groupby operator, which is widely used in relational as well as OLAP database applications. XQuery requires group-by operations to be simulated using nesting [2]. Studies addressing the need for XML grouping fall into two broad categories: (1) Provide support for grouping at the logical or physical level [6] and recognize grouping operations from nested queries and rewrite them with grouping operations [4, 5, 9, 12]. (2) Extend XQuery FLWOR expressions with explicit constructs similar to the group-by, order-by and having clauses in SQL [3, 2]. However, direct algorithmic support for a group-by operator is not explored. In this paper, we focus on efficient processing of a groupby operator for XML – with the additional goal of supporting a full spectrum of aggregation operations, including holistic ones such as median() [8] and complex nested aggregations, together with having clause, as well as moving window aggregation. Consider the simple catalogue example in Figure 1. This can be part of an input XML database, or intermediate result of a query. The catalogue is heterogeneous: it contains information about books, music CDs, etc. Books are organized by Subject, e.g., physics, chemistry. For each book, there is information on its Title, Author, Year, #Sold, Price, (publisher) Name, etc. Books may have multiple authors. The data value at a leaf node is shown in italics. The node id of a node is also shown for future discussion. Consider the following nested group-by query Q1. While we could follow the syntax proposed by [2], syntax not being our main focus, we use a more concise form. We also omit the selection part of the query, and just focus on the aggregation part.
منابع مشابه
Apply Uncertainty in Document-Oriented Database (MongoDB) Using F-XML
As moving to big data world where data is increasing in unstructured way with high velocity, there is a need of data-store to store this bundle amount of data. Traditionally, relational databases are used which are now not compatible to handle this large amount of data, so it is needed to move on to non-relational data-stores. In the current study, we have proposed an extension of the Mongo...
متن کاملApply Uncertainty in Document-Oriented Database (MongoDB) Using F-XML
As moving to big data world where data is increasing in unstructured way with high velocity, there is a need of data-store to store this bundle amount of data. Traditionally, relational databases are used which are now not compatible to handle this large amount of data, so it is needed to move on to non-relational data-stores. In the current study, we have proposed an extension of the Mongo...
متن کاملEfficient XML - to - SQL Query Translation : Where to Add the Intelligence ? ( Extended
Exporting XML views of relational data gives rise to the problem of translating XML queries into SQL. To date, the focus of most of the work in the published literature [9, 14, 20] has been on mechanisms for correctly translating complex XML queries into SQL queries, with less emphasis on evaluating the quality of the resulting SQL queries. The efficiency of the SQL queries generated by the tra...
متن کاملXML and the VITAL Standard: The Document-oriented Approach for Open Telemedicine Applications
This paper describes an effort to create a common, document-oriented architecture for the interchange of medical data in healthcare telemedicine applications. Key components are: The VITAL standard specifying a common (medical device independent) representation of Vital Signs Information and the Extensible Markup Language (XML) specifying the document specifications form, an architecture that, ...
متن کاملQuerying XML Views of Relational Data
XML has emerged as the standard data exchange format for Internet-based business applications. This has created the need to publish existing business data, stored in relational databases, as XML. A general way to publish relational data as XML is to provide XML views over relational data, and allow business partners to query these views using an XML query language. In this paper, we address the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006